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Abstract. It is now well-known that, with high probability, the additivity of minimum output 
entropy does not hold for a pair of a random quantum channel and its complex conjugate. 
We investigate asymptotic behavior of output states of r-tensor powers of such pairs, as the 
dimension of inputs grows. We compute the limit output states for any sequence of well-behaved 
inputs, which consist of a large class of input states having a nice set of parameters. Then, 
we show that among these input states tensor products of Bell states give asymptotically the 
least output entropy, giving positive mathematical evidence towards additivity of above pairs 
of channels. 



n | 1. Introduction 

In this paper, we investigate the asymptotic limits of output states of (<£ (g) <&)® r , where is 
a random quantum channel whose input space grows. This kind of pair of a quantum channel 
and its complex conjugate, (g) <!>, which we shall call conjugate pair, has been known to violate 
the additivity of minimum output entropy in the following sense: 

Srnin($ ® $) < Smin($) + 5Un(*) 

This is generically true when the dimensions of concerned spaces are large in a certain regime. 
Here, the minimum output entropy 5 m i n (-) is defined as S , m i n ( < ^) = min^ S(Q(p)), where S(-) 
is the von Neumann entropy and p ranges over all the inputs. This violation of additivity 
was shown first by Hastings [17] for random unitary channels (see also [TS]) and then later 
the violation was proven for general quantum channels [H [T^l [2]. The additivity question of 
minimum output entropy was raised by King and Ruskai in [21j for any pair of quantum channels, 
and it attracted more attention when Shor proved its equivalence to additivity question of Holevo 
capacity in [27] . 

One of the interesting facts which yield this violation is that any conjugate pair $ $ has 
an output with a rather large eigenvalue for a Bell-state input. This phenomenon was pointed 
out first in [18] to disprove the additivity of minimal output p-Renyi entropy for p > 1 (see 
also [3] ) , and the precise limit eigenvalue distribution was calculated in [8j [10] , using random 
matrix theory. A search for the optimal input for this conjugate pair was carried out in [6j[7] and 
it was shown that among some large classes of input states, a Bell state asymptotically gives the 
least output entropy through the random conjugate pair $ ® This of course does not imply 
that a Bell state actually gives the least output entropy but it gives solid mathematical evidence 
for the physical intuition that a Bell state is the optimal input for small output entropy. 
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In this paper, we ask the same kind of question: "what is the optimal input for $® r ?", 
where ^ = $ ® This question is related to the additivity question for r-tensor power of the 
conjugate pair: 

S min (^l = rS min (*) 

which was positively conjectured in [17] : the paper argues an intuition that for low output 
entropy only bipartite entanglement between $ and is useful while multi-partite entanglement 
is not. We enforce this intuition mathematically by showing that among large classes of inputs 
products of Bell states asymptotically give the least output entropy where bipartite spaces for 
these Bell states make pairs of <1> and 

The novelty of our results consists in the fact that we are considering arbitrary tensor powers 
of ^® r , whereas previous work [IH1 E7J IH1 HD1 El E] dealt witn the case r = 1. Although 
some weak form of additivity for $® r , where <3? is the random quantum channel, was shown 
in [23], our results are different in the following sense. We specify the limiting output matrix 
of v &® r for any well-behaved inputs which have a set of stable parameters as the dimension of 
system grows, in order to discuss on optimal inputs. The main result of the paper can be stated 
informally as follows: 

Theorem 1.1. Let <3? ra be sequence of random quantum channels defined by random Haar isome- 
tries and put \P„ = 4> n ® $ n . For any r ^ 1, any sequence of "well-behaved" input states for 
\E f ® r yields an asymptotical output entropy larger than products of r Bell states. In other words, 
"well-behaved" inputs can not violate the additivity relation for 

To prove the above statement, we introduce new techniques to study high tensor powers of 
random matrices. Not surprisingly, we have to deal with operators corresponding to the action 
of the symmetric group on tensor powers of vector spaces, due to the use of the Schur-Weyl 
duality via the Weingarten formula. First, we perform a detailed spectral analysis of these 
operators, needed to state optimality result for the von Neumann entropy. Secondly, in the 
appendix, we compute bounds for generalized traces of arbitrary matrices acting on tensor 
powers, in terms of the L , L? or the L°° norms of the matrices. We believe these two technical 
results to be important on their own and to find applications in future work on the subject. 

The paper is organized as follows. In Section [2] we briefly explain the setting and the tools: 
random quantum channels and graphical Weingarten calculus. Then, in Section [3l we analyze 
some operators defined via permutations, which are used later. After calculating the limiting 
output matrix of \&® r for well-behaved inputs in Section HI we conclude in Section [5] that among 
these, tensor products of Bell states yield the least output entropy. In Section [6] we show that 
GHZ states and generic multi-partite inputs asymptotically give the maximally mixed outputs. 
Finally, in Appendix we obtain bounds for generalized traces of matrices, which are used to 
prove results in Section [H 

2. Random quantum channels and the graphical Weingarten formula 

This section is split into three parts, which treat, in order, random quantum channels, the 
algebraic Weingarten formula, and its graphical implementation. 

Random quantum channels. A quantum channel $ : M n (C) — > M n {C) is a linear, com- 
pletely positive and trace preserving map [26J. By the Stinespring dilation theorem, such a map 
can be written as 



(1) 



=Tr C k(VXV*) 
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where 

(2) V : C n -»• C k ® C n 

is an isometry, V * V = i\ One can define a probability measure on the set of quantum channels 
starting from the Haar measure on the unitary group lA{kn) in the following way. We endow 
the set of isometries C n — > C fc (g> C n by truncating a unitary matrix £7 distributed along the 
Haar measure and we consider the image measure on the set of quantum channels. In other 
words, 

(3) = Ti ck (U(P e ®X)U*), 

where P e 6 M&(C) is a rank one projector, called the state of the environment. 

If one switches the roles of output and environment spaces, we obtain the complementary 
channel $ c : M n (C) -> M fc (C) [El [20]: 

(4) $ C (X) = rr c »(yiy*) or $ C (X) = Tr c »(£/(P e p)?7*) 

A quantum channel and its complementary channel share the same non-zero output eigenvalues 
for any rank one input state, see |19tl2U] for details. We shall prefer working with complementary 
channels because we are interested in the asymptotic regime n — > oo and k fixed. 

Unitary integration. Our random matrices are built from Haar random unitary operators, 
so we are interested in computing averages with respect to this measure. The computation of 
averages of monomials in the entries of a random unitary matrices has been performed in [5[I13|. 

Theorem 2.1. Let n be a positive integer and i = (ii, . . . ,i p ), i' = (i' l7 . . . ,i' p ) , j = (ji, . . . ,j p ), 
j' = (j [ , . . . , j') be p-tuples of positive integers from { 1 , 2, . . . , n} . Then 



(5) / U hjl ---U ipjp U i[fi ---U ilvjlp dU = 

JU{n) 

E 5 nU) ■ ■ ■ V Q(P) ^ (1) ' ' ' S j P J', M W g(«' ^ la ^ 

a,/3&S p 

where the function Wg is called the Weingarten function [T3]. If p ^ p' then 



(6) / U hjl ---U ipjp U i r j{ ---U i , j , dU = 0. 

JU(n) ' ' 

Let us recall the definition of the unitary Weingarten function. 

Definition 2.2. The unitary Weingarten function Wg(n,<r) is a function of a dimension pa- 
rameter n and of a permutation a in the symmetric group S p . It is the inverse of the function 
a i — y ri& a under the convolution for the symmetric group algebra (if a denotes the number of 
cycles of the permutation a): 

(7) Vcr, vr G S p , Yl W s( n > ^" 1 ^)« #(T " 1 " ) = 

t£S p 

It has the following asymptotics 

(8) Wg(n, a) = n- (p+|ff|) (Mob(a) + 0(n" 2 )), 

where the Mobius function is multiplicative on the cycles of a and its value on a r -cycle is 

(-rr^caw, 

where Cat are the Catalan numbers and \a\ is the length of a , i.e. the minimal number of 
transpositions that multiply to a. 
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Finally, note that the function \a\ = p — j^a induces a distance on S p , d(a, ir) = \a 1 ir\. 

Graphical Weingarten calculus. We recall now the graphical Weingarten method for com- 
puting unitary integrals, introduced in [8j. We shall only give the main ideas, referring the reader 
to [8] for all the details. Recent work making heavy use of this technique is [TO ] fTTl [T2], ISj H| 17]. 

The Weingarten graphical calculus builds up on the tensor diagrams introduced by physicists 
and adds to it the ability to perform averages over unitary elements. In the graphical formalism, 
tensors (vectors, linear forms, matrices, etc) are represented by boxes. To each box, one attaches 
labels of different shapes, corresponding to vector spaces. The labels can be empty (white) or 
filled (black), corresponding to spaces or their duals: a (p, q) -tensor will be represented by a 
box with p white labels and q black labels attached. 

Besides boxes, our diagrams contain wires, which connect the labels attached to boxes. Each 
wire corresponds to a tensor contraction between a vector space V and its dual V*: Vx V* — > C. 
A diagram is a collection of such boxes and wires and corresponds to an element in a tensor 
product space, see Figure [TJ 




Figure 1. Diagram for a matrix acting on a tensor product space and for a 
tensor contraction between matrices B and C. 



Let us now describe briefly how one computes expectation values of such diagrams containing 
random Haar unitary boxes. The idea in [8] was to implement in the graphical formalism the 
Weingarten formula in Theorem 12.11 Each pair of permutations (a, (3) in ([5]) will be used to 
eliminate U and U boxes and wires will be added between the white, resp. black, labels of the 
box U with index i and the white, resp. black, labels of the box U with index a(i), resp. (3(i). 
In this way, for each pair of permutations, one obtains a new diagram, called a removal of the 
original diagram. The graphical Weingarten formula is described in the following theorem [8]. 

Theorem 2.3. If D is a diagram containing boxes U, U corresponding to a Haar- distributed 
random unitary matrix U E U{n), the expectation value ofD with respect to U can be decomposed 
as a sum of removal diagrams T) a ^, weighted by Weingarten functions: 

E V (V) = ^P a , /3 Wg(n,a- 1 /3). 

<x,P 

3. Partial permutations and their associated operators 

In this section we define partial permutation matrices, central objects which will appear 
several times in the paper. Then, we construct several matrices out of partial permutation 
matrices, which are used later in this paper. Readers may just look over this section to get 
familiar with newly defined matrices and come back to it whenever needed. 

For a permutation a £ S r , we define P® £ M n r(C) the tensor permutation matrix on simple 
tensors by 

(9) VxiGC", P® (xi <g> • • • ®x r ) = x CT -i ( i) (8) ••• (gia^-i^). 

These operators correspond to the usual action S r r\ (C n )® r . Note that we use the tensor 
superscript to distinguish these operators from the usual permutation matrices P a which act as 
P a ei = e<j- on a basis {e±, . . . , e n } of C n . 
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A partial permutation on {1, 2, . . . , r} is denned to be an injective map 

a : dom(a) — > {1, 2, . . . , r}, 

where dom(a) stands for the domain of a. The set of all partial permutations of {1, 2, . . . , r} is 
denoted by S r . We denote by the partial permutation with empty domain. The cardinality 
of S r is 



(10) |S r | = £ 



2 

k\ 



k=0 

We endow the set S r with the "map extension" partial order: a ^ f3 iff 

dom(a) C dom(/3) and a{i) = /3(i)Vi G dom(o). 

For this partial order, the empty partial permutation is the unique minimal element and every 
"full" permutation a G S r C S T is a maximal element. 

Suppose we have a 2r-partite space (C n )® r (g) (C n )® r and name the spaces as follows: 

(C n )® r ® (C n )® r = Cp i)T] Cf 2jT] ® • • • ® Cf r>T] 8) C|i )B] 8) Cf 2iB] ® ■ ■ ■ <g> C£ jB] , 

where T stands for "top" and -B stands for "bottom" (the spaces should be imagined stacked 
vertically, see Figure [2] for the case r = 2). For a partial permutation a G S r define 

(11) 4 n) = ® ® 7 

a;Gdom(a) 

where G CfJ, T i ® B j is an un-normalized Bell state: 

n 

(12) b<$ = ® /*/; 

where {e^} and {/?} are the canonical bases for the spaces with labels [x,T] and [y,B], respec- 
tively. In other words, 

i) If x G dom(a), then acts on CjJ, T j (8) « as the un-normalized Bell state; 

ii) If x g" dom(a), then Ta acts on T j as the identity; 

hi) If y ^ q(x) for any x G dom(o), then acts on ™ as the identity. 

In Figure [21 we use the graphical notation from Section [2] to represent the seven different 
operators in the case r = 2 (note that 7 = 1 + 4 + 2 as in (jlUp). 

o • ex /• a /• o • o • ck m ck a 

o • cm (-m cA A» cx cx cO Cm oA A-« 

o • c/ ^« o-/ — U» c/ ^» cA — h» oO Q« ay 

o • o • a o • or c/ c/ ^» 

Figure 2. Diagrams for the operators Ta in the case r = 2 

We also define the normalized versions of the above operators: 

(13) £W = ii?W; i*»>= (J?) Bi®7, 

which are projections. 



^.s/ n ,y ' x,a(x) 

xSdom(d) 
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In the rest of this section, we compute (asymptotically) the spectrum of an operator appearing 
in the study of the output states of Theorem 14.31 The main result here will be used in Section 
[5] to analyze the optimality of input states for random quantum channels. 

For a partial permutation a G S r , define the operator 

(]_4) Q( n ) = /p(")(_-Qldom/31-|domq| 

Our goal (see Proposition 15. ip is to show that the matrices Qa are asymptotically positive 
(i.e. their eigenvalues converge to non-negative numbers) when n — > oo. We shall do more 
than this, by showing that their asymptotic spectrum is the set {0,1}. First, note that any 
partial permutation j3 satisfying (3 ^ a can be written as a "direct sum" j3 = a © 7, where 7 
can be any partial permutations such that the domains and the images of a and 7 are disjoint: 
dom(a) n dom(7) = and a(dom(a)) n 7(dom(7)) = 0. Using this observation, one can write 



(15) Ql 



(») 



xgdom(a) 



where Qq is a copy of Qq with r replaced by r = r — |dom(a)|. From this decomposition, we 

(n) (n) 

see that the operator plays a special role, as it appears in every other Q a . Moreover, it 
is clear that is asymptotically positive iff is. 

Since we expect the operator G M n 2r(C) to have eigenvalues with high multiplicity, we 

study it as a left multiplication operator on an algebra A n 3 1. Besides the operators T a n ^ (which 
connect "top" spaces with "bottom" spaces), the algebra A n contains all tensor permutation 
operators acting separately on the top and the bottom spaces. Formally, the A n is defined as the 
algebra generated inside M n 2r (C) by the operators T a n ' and the tensor permutation operators 
of the form P® ® P® B , for any permutations ttt,b G S r : 

A n = alg ({T^} aeSr U {P® T ® Pf B U >B eS, 

Up to normalization constants (depending on n), the algebra A n has a combinatorial struc- 
ture, induced by the wire contractions. It can be easily seen that the product T^Tg 1 can be 

written as cT-y ] P® T <8> P® B for some partial permutation 7 G S r , permutations 7r*r, ttb G S r and 
a constant c > (actually, c is a positive power of n). We introduce next the following subsets: 

• X 1 = {Ti n) :a£S r } 

• X2 = {P® T ® P® B ■ ttt, G S r } \ X\\ permutation matrices not of the above type; 

• X 3 = {ri n) [P® T P® B ] : a G S r and vr r , vr B G S r } \ (X x U X 2 ): products not of the above 
types. 

The set X = X\ U X2 U X3 admits the following elegant description. 

Lemma 3.1. The set X is in bijection with the set of permutations o/2r objects. More precisely, 
the map 

Sir % 
a h-> (id ® t)P® 

is a bijection, where t denotes the transposition operator on M n r(C) acting on the "bottom" 
subspaces. 
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(a) (b) (c) 



Figure 3. Graphical representation of operators from X: (a) the labeling of the 
spaces, (b) possible (blue) and impossible (red, dashed) wires for an element of 
X, (c) possible and impossible wires for an element of (id ® t)X. 

Proof. Graphically, elements in X are collection of wires (tensor contractions) connecting 4r 
points (vector spaces): on the left hand side [x,T/B]l (corresponding to primal spaces) and 
on the right hand side [y,T/B]n (corresponding to dual spaces), see Figure By definition, 
X contains all such diagrams, with the exception of the ones connecting some [x, T]l to some 
[y, B]ji, or some [x,B]l to some [y,T]n. The partial transposition operation has the following 
effect on the diagram of an element of X: it swaps the bottom left and right points, i.e. 
[x, B]l [x, B]r. Hence, (id<g>t)X contains all wire diagrams, such that all wires are "crossing" , 
i.e. points on the left are connected to points on the right. But these are exactly the diagrams 
for tensor permutations of S 2r which concludes the proof. □ 

Lemma 3.2. The set X is a basis for the algebra A n , for all n ^ 2r. 

Proof. It is clear from the definitions that the elements in X span the algebra A n = span c A\ 
Using Lemma 13,14 we need to show that the family of tensor permutations matrices {P®} a £g 2r 
is linearly independent in M n 2r(C). To this end, consider a linear combination 

E a ° p ° = 

Fix some permutation tt G S 2r - Since n ^ 2r, we can pick an orthonormal family {ei, . . . ,e 2r } 
in C n . Let x = e^^i) <8> e n ( 2 ) ® • • • <8> e 7r ( 2r ) and y = e\ <g> e 2 <8> • • • ® e2 T - It is easy to check that 
(y, P®x) = 5 n ^ a , hence = 0. This shows that the linear combination must be trivial and thus 
the tensor permutation matrices are linearly independent. □ 

Remark 3.3. Notice that usual permutation matrices are not linear independent, as one can 
see from the following relation holding in S^: P(i2) + -^(34) = ^*(12)(34) + ^id- 

Write Q = (see (H3D) and define the left multiplication operator on A n as follows: 

Lq : A n — > A n 
A^QA 

Lemma 3.4. One has the following inclusion of spectra 

spec(Q) C spec(LQ) 

Proof. Since Q is Hermitian, write 

k 

Q = Y, XiPi 

1=1 



s 
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to be the spectral decomposition of Q, where Aj G M. and Pi are projections on the eigenspaces. 
Then, 

(Aj id -Lq) Pi = (XiI-Q)Pi = Q 

On the other hand, 

Pi e alg{Q} C An 

Hence, A, id— Lq is not invertible on A n and thus Aj 6 spec(Lg). □ 

The algebra A n has a combinatorial nature, and dim(„4 n ) = (2r)!. Notice that the dimension 
of A n does not grow with n, fact which is crucial for the analysis that follows. The previous 
lemma relates the spectrum of the operator Q = (a high dimensional object) to the 

spectrum of Lq which is of bounded dimension. We now state the main result of this section. 

Theorem 3.5. For all r ^ 1, the spectrum of the matrix Q = is at a distance 0(l/n) 
from the set {0, 1}. 

Proof. By the inclusion of spectra proved in Lemma 13.41 it suffices to show the conclusion for 
the operator Lq. By doing this, we are working with a matrix of fixed size, the dependence 
in n appearing in the entries of Lq. For computational simplicity, we rescale the basis X into 
X, in the sense of (|13p . so that all the elements in X are projections, up to multiplication by 
P nT P-kb- Based on Lemma 13.21 we write the operator Lq in the basis X. 





1 


2 


CO 


h- > 




L\fl 


£1,3 


2 


-^2,1 


-^2,2 


£2,3 


CO 


V L 3,1 


-^3,2 


£3,3 



The indices above indicate which spaces are associated; i corresponds to the space spanned by 
X{. First, we analyze the blocks -^2,1 and L31. We claim that 

rp( n )f(n) _ if a V /3 exists in the poset (S r , ^) 

a 13 l0(n _1 ) otherwise 

Here, we understand 0(n _1 ) to be an element of norm 0(n~ 1 ) from A n - Indeed, one has that 



I • B 



B x ,y <8> I if x = z and y = w 

B x ,y <8> B z ^ w g) I if x 7^ z and y / w 
Ofn -1 ) otherwise 



This follows from the fact that in order to obtain non-vanishing elements when concatenating 
B X)V and B zw , the half loops have to meet either the identity matrix or the exact same half 
loop. Taking products of the previous relations yields (|16p . Then, 

(17) Qf^ = Q^fjP* = ^(_l)|d°m(a)|^n)^n) 

a 

_ ^ ^_-Q|dom(ai)|+|dom(a<2)| j^n) j>H _|_ Q^ n ~^ 

dom(a 2 )ndom(/3)=0 
a2(dom(a 2 ))n/3(dom( / 9))=0 

_{Q + 0{n- 1 ) if/3 = id 
I 0(n _1 ) otherwise 
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In the second equality, we gather all the vanishing terms in the 0(n 1 ) such that the surviving 
partial permutations a are of the form a\ © cti- For the last equality, we used: 



This implies that 
£1.1 



/I 

±1 



£ (_l)l dom («i)l = (1 - l)|dom(/3)| = ^ 
\ 



0(n 



L %1 = 0(n- x ); L, 1 = 0{rT 1 ) 



J 



The signs ±1 appearing in the first column of depend on the order of basis matrices and 
we do not require their exact values. Note also that one can show that 1/2,1 = 0, but we will 
not need this result in what follows. 



~(n) 

Next, we study £1,2, -^2,2 and £3,2- Since T = 7 G" X2, we know that T a acts on X2 as 

j.(n)p5 



7T7 1 



7T^ 



P® if a 



€ span(Af 3 ) otherwise 



Hence, QP® © P« 



^ 5 

Li, 2 = 
Finally, using (fl~6|) . we have 



T( j P® B + X 3 , with X 3 G span(A'3). Thus, 



L 



2,2 = I; L 3 2 = (unspecified) 



p(n) 

-aV/3 



if a V /3 exists in the poset (S r , ^) 



(18) Wl*^ othemse 

Since 7 Af 3 we have /3 7^ and then it follows from (|17p that 

£1,3 = Oin- 1 ); L 2 , 3 = 0(0; L 3 , 3 = O^ 1 ) 

Therefore, as n — > 00, the matrix representation of Lq converges to a lower semi-triangular 
matrix whose diagonal elements are or 1, proving the theorem. □ 

As a corollary of the theorem, we obtain the asymptotic positivity of the matrices Q a ■ 

(n) 

Corollary 3.6. For all r ^ 1 and a G S r , the spectrum of the matrix Q a is at a distance 
0(1 /n) from the set {0, 1}. 



Proof. This follows from equation (|15p . Theorem 
equation ([15]) is a rank-one projector. 



and the fact that first tensor factor in 

□ 



4. Output states for product of conjugate random quantum channels 

This section contains the main probabilistic result of the paper, a limit theorem for a model 
of random matrices. We investigate output quantum states of a product of r identical random 
quantum channels and their complex conjugates: 

(19) *® r = $® r 

Here, *f? n = <!>„ ® <£ n where $ n are random quantum channels, as defined in Section [21 Given a 
sequence of input states ip n G C 2rn , we are interested in the sequence of output states 



(20) 
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where we consider complementary channels . Note that, as it was noted in Section[2l replacing 
a channel by its complementary does not change the (non-zero) spectrum of the outputs for 
pure inputs. This is important for us given the asymptotic regime we are interested in (k fixed 
and n — > oo), since the output matrix Z n lives in the fixed space M fc 2r(C). 

In the graphical calculus, the sequence of vectors ip n on C 2rn is represented as the rn x rn 
matrix A n through usual isomorphism between C 2rn and M rn (C), see Figure H] for the case 
r = 2. 




Figure 4. Diagram for the output matrix Z n in the case r = 2. 

We are interested in computing the asymptotic moments ETrZn of the output matrix Z n . 
To do this, we are going to use the graphical Weingarten calculus, Theorem 12.31 The diagram 
for Tr Zn can be thought of containing p copies of the diagram in Figure H] connected in a tracial 
manner. This diagram contains 2rp boxes U, which shall be labeled by triples [i,x,P], where 

• i 6 {1, . . . ,p} indicates the index of the copy of Z n the U box belongs to; 

• x & {1, . . . , r} denotes the index of the channel $ or 

• P G {T, B} indicates whether the box U belongs to a $ channel (P = T) or to a $ 
channel (P = B). 

Let 5 6 S2r P be the permutation that swaps top and bottom indices 

p r 

S = l[l[([i,x,T] [i,x,B]) 

1 = 1 x = l 

Also, define 7 £ S2 rp as the permutation that encodes the trace of the product of the Z n 
matrices, see Figure [5j 

r r 

j=l[([r,x,T) [r-l,x,T]---[l,x,T])H([l,y,B] [2, y, B] ■ ■ ■ [r, y, B\) 

x=l y=l 

Then, we have the following formula for the moments of the random matrix Z n : 
Theorem 4.1. For a given sequence of inputs ip n 

(21) ETrZ£ = (l + 0(n- 2 )) £ f[ [^„|f£% n > H" 1 ) l dom ^)H dom ^)l 



=1 
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-3 



[1,1,71 [2,1,21 \PA,T] 




-3 



[l,r,T] [2,r,T] \p,r,T] 




[1,1,5] [2,1,5] [p,l,B] 



[l,r,B] [2,r,B] \p,r,B] 

Figure 5. A representation of the permutation 7 encoding the tracial structure 
of the moments. Note that in the top r rows, 7 acts like a decreasing cycle, 
because the unitary operator U appears on the left hand side of the channel 
so it is connected to the U operator appearing in the previous block. In the 
bottom rows, the situation is reversed for the channel <£. 

Here, the operators To 1 where defined in (|13|) . cti, /?j 6 S r are partial permutations, and a 6 S2rp 
is a permutation induced from the cti 's in the following way: 

p 

(22) a = H [] ([i,x,T},[i, ai (x),B}) 

i=l iGdom(ai) 

Proof. First, the graphical Weingarten formula from Theorem 12.31 allows us to write ETr[Z^] 
as a sum over diagrams indexed by a pair of permutations (a, /?) G S2 rp - For such a pair, the 
corresponding diagram will contain (see Figure for the wiring of a box U appearing on the 
top): 




Figure 6. Wires attached to a U-box appearing on top (<3? channels) before and 
after the graph expansion are represented in (a), respectively (b). 



(1) #a loops corresponding to C n , which give a contribution of n^ a ; 

(2) #(a _1 7) loops corresponding to C k , which give a contribution of k^ a 

(3) necklace diagrams containing wires and A n boxes, which give a contribution of /a„(/3). 
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It follows that 

(23) ETr[Z%]= £ n* a k*^-^ f An (P) Wg(a" 1 /3) 

<X,/3ES2rp 

The contribution of ^-necklaces f An (/3) can be expressed using the formalism of generalized 
traces introduced in Appendix^! see Figure [7] for a graphical proof of this fact: 

(24) f An (/3) = TVi* (Ai, Al, ■ ■ ■ , A, A) 



Figure 7. The contribution of the necklaces containing the matrices A n and A* n 
is given by the generalized trace of these matrices with respect to the permutation 
p~ x S. 

Note that, in order to have the matrices A and A* n appearing in the right order, we have to 
order the triplets [•, ■, •] as follows: [i, x, P] < [j, y, Q] if and only if i < j or (i = j, P = T and 
Q = B) or (i = j , P = Q and x < y). One can use Lemma fA. II to bound the generalized trace 

(25) f An ((3) ^ n d[st ^~ 15 ^ = n^" 1 ' 65 ) 

To obtain this bound, we used the normalization condition \\ip n \\ = HAH 2 = 1> Recall that the 
set of permutations is defined to be 

6= \J {9 G S 2pr : Ve G £Vx G {1,... ,r}, 9{[e,x}) = [/,*] for some / G F 

EUF={l,...,p}x{T,B} 
\E\ = \F\=p 

and V/ G F Vx G {1, . . . , r}, #([/, x]) = [e, *] for some e G E} 

Hence, using j^a = 2rp — \a\ and the Weingarten asymptotic Wg(a -1 /3) 
have 

(26) (The power of ninETr[Z£]) < 2rp - \a\ + dist(/3~\ 05) - 2rp - |a _1 /3| 

(27) < min \/396\ - 1/31 < min \95\ = 

see eee 

Here, we have used the triangle inequality twice \a\ -(- 1 q" 1 /3 1 ^ |/3|, |/30<5| — 1/3| ^ \95\ and the fact 
that 5 G 0, which is obvious from the definition of O, with the choice I? = Ex = {1, ...,/?} x {T} 
audF = F B = {l,... J p}x{B}. 

In order to find the contributing pairs (a, /3), we have to investigate the equality cases in the 
above inequalities. For the first triangle inequality \a\ + |a _1 /3| ^ |/3|, the bound is saturated 
if and only if a is on the geodesies between id and ft, denoted by id — > a — > ft. For the second 
inequality min^ge \/30S\ — \ft\ ^ 0, the equality case reads 

(28) mm\ft95\ = \ft\ 

To conclude, we aim at finding all pairs (a, ft) such that 

(29) id -> a -> ft 

(30) \P06\ > \ft\ for all 9 G 
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Consider a permutation a = ot © 0\b G ^2rp where cft,b leave invariant the top and bottom 
elements, i.e. 

(31) <7 T ({l,...,p} x {l,...,r} x {T}) = {1, . . . ,p} x {1, . . . , r} x {T} 

(32) <t b ({1, . . . ,p} x {1, . . . ,r} x {£?}) = {1, . . . ,p} x {1, . . . , r} x {B} 

Then, one can check that a5 G O, for the choice E = Et, F = Fb- In other words, for all 
otiOBi we have ot ® &b G @<5- In particular, 

(33) Vi,j,x,y, ([i,x,T], \j,y,T]), ([i,x,B], \j,y,B]) G @S 

Note that we do not exclude the case i = j or x = y. 

Consider now a transposition r = ([i, x, T], [j, y, B]) with i ^ j. We have 

TS = ([i,x,B],\j,y,B},\j,y,T],[i,x,T]) [] ([I, z, T), [I, z, B]) 

l,z 

(l,z)i{{i,x),(j,y)} 

One can see that t5 G G for the choice 

(34) E = {l,...,p}x{T}\{(j,T)}U{(j,B)} 

(35) F = {1, . . . ,p} x {B} \ {(j, B)} U {(j,T)} 

We have thus shown that the following transpositions belong to Q5: 

• {[i,x,T],[j,y,T]),([i,x,B},[j,y,B]), for alH, j, x, y; 

• ([i, x, T], [j, y, B]) for all i, j, x, y such that i ^ j. 

On the other hand, it follows from |25l Lemma 23.10] that \a-(i,j)\ = \cr\ — 1 for any permutation 
<j where i,j belong to the same cycle. Hence, in order for (j30|) to be satisfied, we see that the 
following pair of elements can not belong to the same cycle of (3: 

• [i,x,T] and \j,y,T], for all i,j,x,y; 

• [i,x,B] and [j,y,B], for all i,j,x,y; 

• [i,x,T] and [j, y,B] for all i,j,x,y such that i ^ j. 

Note that (3 can not have cycles of length larger than 2, since in that case at least two of the 
elements in the cycle would have the same top or bottom index, contradicting one of the first 
two conditions above. It follows that /3 is a product of disjoint transpositions (swapping top 
and bottom elements). The final condition above implies that these transpositions should swap 
elements belonging to the same "i" group. Therefore, the equation (f3"0"j) finally implies that j3 
should be of the form 

v 

(36) /3 = [] H ([i,x,T\,[i,Pi(x),B]) 

i=l xGdom(/3i) 

where (3i G S r for i = 1, . . . ,p. 

Having solved the equation ([HUD, we move on to finding a satisfying (f2Uj) . Since /3 is a product 
of disjoint transpositions (|36|) . a should be constructed from a subset of the transpositions 
appearing in /3. It follows that 

v 

(37) a = ]J ]~] ([i,x,T},[ i,ati(x),B}) 

i=l x£dom(cti) 
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for partial permutations ai S S r satisfying on ^ Plugging the values for a and (3 in equation 
(1231). we obtain 



(38) ETrZ? 



(i + o ( n- 2 )) £ fe #(a " l7) , di {f ( ;^ ^ Ha ' 1/3| (-i) |a ' 1 

t*,/3 as in J57t.i55t 



The error term l + 0(n -2 ) comes from the Weingarten asymptotic ([8]) and from the fact that the 
expression 2rp — \a\ + dist(/3 _1 , 0<5) — 2rp — |a _1 /3| appearing in (f26|) has a constant parity as a 
function of (a, (5). Indeed, this follows from the fact that the \o~t\ = \o~\ ± 1 for any permutation 
a and transposition r. Since both a and (3 appear twice in the expression we are investigating, 
the parity conservation property follows. 

We are going to further simplify the formula for ETrZ„ by using the fact that |ce _1 /3| = 
Y2i |dom(/3j)| — |dom(aj)| and, for /3 as in (|36|) . 



/A„(/3)=n^i r ft ) i^ 



i=l 



Furthermore, note that for such /3, dist(/3 _1 , 9(5) = Y%=i |dom(/3j)|, since with non trivial partial 
permutations /3j, (35 is not an element of 0, so in order to correct j3 in such a way that f35 £ 0, 
one has to undo the wires appearing in dom(/3j), for all i. With all the above ingredients, we 
obtain the announced formula for the p-th. moment of the output matrix Z n : 



(39) ETr Z% 



;i + o(n- 2 )) ^ |a " l7l f[ 



i=l 



n |dom(ft)l 



ix |dom(ft)|-[dom(a;)| 



□ 



Since the matrix Z n is living in a space of fixed dimension, convergence of moments can be 
easily translated into the convergence of the random matrix itself. We define for (3 € S r 



(40) 



R 



(k) 



(g) \B^ {x) -k^I]®I 

x-£dom(/3) 



E 



D 



(k) 

x,f3(x) 



f_ k -U\dam(fi)\-\A\j 



ACdom(/3) \xeA 
V" • (-fc- 1 )l dom (0)H dom («)l 



where the second equality follows from the binomial formula. 

Theorem 4.2. Consider a sequence of input states ip n . Then Z n has the following expectation: 
(41) EZ n = (1 + 0(n- 2 ))k- 2r £ (^nlf^Vn)^, 



where the 0{n 2 ) error appears in each entry of the matrices and is defined in (|40p . 

Proof. As in the previous theorem, we shall use the graphical Weingarten calculus, with the 
major difference that this time, we are averaging operators, and not scalars. Moreover, we are 
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in the simplest possible case, p = 1. By replacing k&( a ^ by the k 2r x k 2r matrices Ta (see 
(fTT|) ) in equation ([23]) . we obtain 



(42) E2 n = ^ n^/^GSjWgfa-^.rW 

Since A; is fixed, the terms which survive asymptotically are the same as before, so we conclude 

= (i + o{n- 2 ))k-^ y, mf^^Rf 

□ 

In order to be able to state almost-sure convergence results for the random matrix Z n , we 
have to make the following assumption on the behavior of the input sequence ip n . 

Assumption on input vectors: a sequence of vectors {ip n )n is called well behaved if for all 
partial permutations (3 6 <S r , one has 

(43) lim <^ n |T<% n ) = a p G [0,1]. 

tj. — ^nn i 



The set of numbers { a p)p £ g will be treated as parameters in our model from now on. Obvi- 
ously, one has the normalization condition O0 = 1. The following theorem is the main result of 
this section. 

Theorem 4.3. Given a sequence of well-behaved inputs, the output matrices Z n converge almost 
surely to 

(44) Z = k- 2r £ a P R f 

Proof. We shall use the Hilbert-Schmidt (or the 2-Schatten) norm to prove the convergence. 
Note however that all norms are equivalent on the finite-dimensional space M k 2r{C). 
It suffices to show that 



(45) j^E[|Z„-EZ„||l < oo 

71=1 

and then apply the Borel-Cantelli lemma to conclude that \\Z n — E Z n ||a converges to zero almost 
surely. We have 

E \\Z n - E Z n \\ 2 2 = ETr(Z^) - Tr[(E Z n ) 2 ] 

The first term above has been computed in Theorem 14. li for p = 2 (recall that the permuta- 
tion a is defined in 

(46) ETrZ2 = (l + 0(n- 2 )) £ f[ [<tf„|r£^ n > (-AT 1 ) l dom (ft)H dom ^)l 

ai<j8i i=l 

"2^/32 
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The second term can be easily computed from Theorem 14.21 



Tr(E Z n f = Tr 



-k 



_]\ |dom(/3)| — |dom(a) 



T (k) 



i + o(n- 2 )) Y ^Tr^rWjn^nir^Vn)!-^ 1 ) 



ai</3i 



|dom(/3i)|-|dom(ai)| 



i=l 



Note that one can compute 

(47) k~ 4r Tr [T ai T a2 ] = k- Ar+#{ - a ~^ = k'^' 1 ^ 

where a £ S^ r is defined as before. We thus get the estimate which allows us to conlcude: 

(48) E \\Z n -EZ n \\ 2 = ETr(Z 2 ) - Tr[(EZ„) 2 ] = 0(n~ 2 ) 

5. Optimality of products of Bell states 



□ 



In this section, we show that among the well-behaved input states satisfying assumption (|43p . 
the ones having minimal output entropy for generic random channels are tensor products of Bell 
(or maximally entangled) states. In particular, we recover results from [6] in the case r = 1. 

First, the conclusion of our main result from Section U Theorem 14. 3\ can be reformulated as 
follows. 

Proposition 5.1. For a fixed sequence of well-behaved input states, the output matrix Z n con- 
verges almost surely to the matrix 



(49) 



a£<S r 



where p a are positive numbers that sum up to one and 



x,a(x) 



<g> I 



(so) z^ = g) a 

lEGdom(ct) 

Here, I is the identity operator normalized to have unit trace and 

C x , y = k~ 2 Bi k l + (AT 2 - k- 3 )I 



Proof. One can rewrite Z^ as 



zf = k- 2r 



B 



(k) 

x,/3(x) 



AT 1 /) + 1 



icGdom(/3) 



Here, the last equality comes from ([40 p . Then, using the Mobius inversion formula for the poset 
S r , we can express the R matrices in terms of the Z matrices: 
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Plugging this expression into the limiting matrix formula in (|44p . we obtain 

Z = ^(.^Idom^l-ldomMI^^) = p a ZW, 

aeS r 

where 

Pa = ^(_l)|dom(/3)|-|dom(a)| a ^ 

All there is left to show is the positivity of the real numbers p a . Using the definition (|43p of 
the coefficients ap, we write 



P. 



a 



n— >oo 

where 



lim (lf> n \Q^ n ), 



Q( n ) ^ V |dom/3| — domalyiC") 



L /3 

Since it was shown in Corollary 13.61 that the operators are asymptotically positive, the 
positivity of the pp follows and the proof is complete. 

□ 

The entropy of the density matrices Z& are easily computed (C is any of the matrices C Xjy ): 

(51) H(Z^) = |dom(a)|F(C) + (r - |dom(a)|) log(A; 2 ), 
Note that H(C) is a constant depending on k, 

(52) H{C) = -(k- 1 + k~ 2 - AT 3 ) log(fc _1 + kT 2 - k~ 3 ) - (k 2 - l)(k- 2 - AT 3 ) log{k- 2 - AT 3 ) 

(53) = h(k~ l + kT 2 - AT 3 ) + (k 2 - l)h{k~ 2 - k~ 3 ) 

(54) < log(A; 2 ), 

where h{x) = — x\ogx. It follows that the entropy of is a strictly decreasing function of 
|dom(a)|. We now state the main result of the current section. 

Theorem 5.2. Among all sequences of well-behaved input states, the ones having a minimal 
output entropy are the ones having parameters 

o-p = lp^w, 

for some (full) permutation tt G S r . In this case, the input state is (asymptotically) a tensor 
product of Bell states (where the matching 3> •<->• $ of the conjugate channels is given by it), the 
output state is Z = Z^ and the output entropy is 

H(Z) = rH(C) 

Proof. Using the concavity of the von Neumann entropy |26} 11.3.5] and equation (|49p . one has 

H(Z) > £ Pa H(Z^). 

The conclusion follows from the fact that the terms with full a have the least entropy. The 
unicity of the minimizer comes from the strict concavity of the entropy and form the fact that 

(k) 

the density matrices Za are different. The rest of the statements in the theorem are trivial. □ 
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6. High entropy outputs and GHZ inputs 

In this section we discuss a class of input states which give maximally mixed outputs, from 
which no information can be extracted. Although such examples are not interesting for the 
purpose of communicating classical information, they have theoretical interest. After stating 
the main result in the following proposition, we discuss the particular cases of GHZ states and 
generic multi-partite pure states. 

Proposition 6.1. Consider a family ip n of normalized input states such that, for all j3 G S r , 

^ n \ff\^ n ) =o(l). 
Then, the output state is asymptotically maximally mixed, i.e. 

a.s., lim Z n = 

n— too k 

Proof. From Theorem 14.31 the output state is asymptotically a mixture of the operators Rp, 

~ (n) 

with coefficients given by the overlaps between the input vector ip n and the operators . 
Our assumption implies that all these coefficients will vanish asymptotically, except for the one 
corresponding to ft = 0. Hence, the almost sure limit of the output state Z n is 

as announced. □ 



We now analyze the particular case of a GHZ input state [TB]. Such a state is defined by 

1 n 

(55) (C n ) 2r 3 iP GKZ = ^yiei®ei®---®ei. 



n 

i=l 



2r factors 




Figure 8. Graphical representation of the GHZ input state. 

For a graphical picture of such a state, see Figure [HJ Either by direct algebraic calculation of 
from graphical considerations, it is easy to see that one has, for all (3 G S r , 

(^GHzIT^Vghz) =n" dom (« 

The GHZ state satisfies thus the hypothesis of Proposition 16.11 and we obtain the following 
corollary. 

Corollary 6.2. The output of the GHZ state ipGHZ through a product of r quantum channels 
and their conjugates 

z n = [^ r ®^ r ]^ GHZ r GH z) 

converges, almost surely, to a maximally mixed state. 

We study now random pure states ip n distributed uniformly on the unit sphere of (C n )® 2r . 
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Proposition 6.3. For any partial permutation /3 ^ 0, with overwhelming probability, a random 
pure input state ip n satisfies 

<vg^ n Vn) = o(i) 

Proof. First, note that 

n^n\f^ n) \ip n ) = Tr(f( n) E = n~ 2r Tr(^ n) ) = n~ 2dom ^ 

We shall use (see [22]) 

Lemma 6.4 (Levy's lemma). Let f : S^" 1 — > M be a function defined on the unit sphere ofM. d 
with Lipschitz constant L. Then 

¥(\f-Ef\>e)^exp(-Cde 2 /L 2 ) 

where the expectation is taken with respect to the uniform measure on S^ -1 and C is a constant. 

with d = 2n 2r and /(V) = (Vn|^ n) |V? n ). Since fj^ is a projector, the function / has Lipschitz 
constant bounded by 2. Putting e = n~ r+s , for some 5 > 0, we obtain the announced result. □ 

When contrasting the above results with the one in Theorem 15.21 one concludes that the 
entanglement present in GHZ or generic states is not suitable for producing low-entropy outputs. 
The structure of the entanglement in the states from Theorem 15.21 seems to be essential in 
obtaining such low-entropy states. 
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Appendix A. Bounds for generalized traces of matrices acting on tensor 

PRODUCTS 

In this appendix we derive bounds for generalized traces of tensors in terms of their Schatten 
norms. These results are in the spirit of those obtained by Mingo and Speicher in [23], with 
two notable differences. We consider general tensors, whereas in [23], the authors investigate 
generalized traces of matrices. On the other hand, the type of traces we look at are less general 
than the ones in [23]. 

In the current paper, we shall only use the L 2 incarnation of the results presented in this 
appendix. However, we think that the results are interesting on their own and might prove to 
be useful in other circumstances. 

Consider a set of k matrices A±, . . . , acting on (C n )® r . Graphically, these matrices are 
represented by boxes with r legs on each side. Let a 6 S kr be a permutation that will be used 
to contract the 2kr legs of the boxes Ai. We shall be implicitly using the bijection {1, . . . , kr} = 
{1, . . . , k} x {1, . . . , r} in such a way that elements in {1, . . . , kr} shall be denoted by pairs 
(i, x), with t'G {l,...,k} and x £ {1, . . . , r}. We call a generalized trace of these matrices the 
quantity (see Figure M for a graphical representation) : 

k 

(56) Tr a {Ai, . . . , A k ) = Tr [ 6Z)A, | P„ 
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trace 



Figure 9. Diagram for generalized trace of the matrices A\, ... , connected 
with the permutation a. The dotted vertical lines on each side of the diagram 
are identified and correspond to the trace in equation ()56|) . 



As a working example, let us consider the following example of a generalized trace: 
(57) Tr{[Tr(8)id](^a) • [id ® Tr](A 2 )} 



In Figure 10(a) , we represent the trace using the graphical formalism from Section [2J The same 
calculus can be represented as a trace of the tensor product of the matrices against a tensor 
permutation matrix, see Figure 10(b)[ The permutation appearing in this example is simply 
the transposition ((1, 2), (2, 1)), swapping the second leg of A% with the first leg of A2. 



(a) 





(c) 



Figure 10. The same diagram, represented in the usual graphical formalism 
(left), as a generalized trace formalism (center) and in the input-output formal- 
ism (right). The two dotted lines in each picture are identified and correspond 
to traces. The input-output presentation on the right has overhead s = 1. 

Before we state and prove the main result, let us introduce two subsets T and O of the 
permutation group Sk r which correspond to two special classes of generalized traces. 

First, the subset T is defined such that the generalized trace (|56p can be written as trace 
of the product of k factors of the type AiP® for pi G S r , where the order does not matter. 
Graphically, T corresponds to the generalized trace which can be rearranged into a "stream" of 
matrices A^s with wires connecting the neighboring matrices. So, this may be called "input- 
output representation". Formally, for a permutation a G Sk, let 

S kr 5 [a] = {/? G S kr I Mi G {1,. . . , k},\/x G {1, . . . ,r},3y G {1, . . . , r} s.t. fi(i,x) = (a{i),y)} 

be the set of permutations of {1, . . . , kr} which preserve the blocks of size r and act like a 
globally on these blocks. Then, 

r = UN 

#7=1 
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Equivalently, recall that the wreath product S r I S k has elements (f3;a\, . . . ,a k ) with (3 G 5/% 
and aj G S r . One has then 

T = {(p;a 1 ,...,a k )€S r lS k , | #/3 = 1} 

Secondly, when /c is even we define the subset to be such that the generalized trace ([56]) 
can be written as trace of product of "rotated two columns" where each column is a tensor 
product of k/2 matrices A{. More precisely, the two-column structure of is encoded into an 
equi-partition {1, . . . , k} = E U F, with \E\ = \F\ = k/2 and we ask for a permutation G O to 
satisfy 

\/i G E, \/x G {1, . . . , r}, 9{{i,x)) = (j,y), for some j G F and y G {1, ... , r} 
Vj G i 7 , Vx G {1, . . . , r}, 6*((j, x)) = («, y), for some i £ E and y G {1, . . . , r} 

Theorem A.l. For any permutation a G S kr and any matrices A\, . . . , A k G M n r(C), t/ie 
following bounds hold. 

(a) L 1 -bound: 

k 

(58) \Tv a (A l ,...,A h )\^l[\\A i \\ 1 

i=l 

(b) L°° -bound: 

k 

(59) \Tr (7 (A 1 ,...,A k )\^n r+dist ^Y[ {{A^ 

i=i 

(c) L 2 -bound: if k is even, then 

k 

(60) ITiv^x, . . . ,40| ^ n dist ^°) J] [l^lla 

i=i 

Proof. The L 1 case follows trivially from the fact that the Schatten 1-norm (and all the other 
p-norms, for that matter) are unitarily invariant and multiplicative under tensor products. See 
Figure [H Note that equality can be achieved in this case by taking A$ = ®j=i xx * f° r the same 
unit vector x G C n . 

Let us now treat the L°° bound, which is conceptually more interesting, and discuss the 1? 
case last. To this end, we introduce the important definition of an input-output presentation of 
the generalized trace given by (A 1} . . . ,A k ; a): it is an operator T : (C n )^ r+S ^ (C")®( r+S ) of 
the form 

k 

(61) T = H(A T[i) ®I n s)P®, 

i=i 

where r G S k is a permutation encoding the order of the matrices Ai in the expansion, P® are 
tensor permutation matrices acting on (C n )®( r+S ), with pi G 5 r +s- Figure 10(c) contains an 
input-output presentation of the generalized trace appearing in ([57]) . The integer s ^ is called 
the overhead of the presentation. We ask for such a presentation to encode the generalized trace 

Tr(T) = Tr 
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Such representations have the advantage that they readily give bounds 

|Tr(T)| < ||T||i ^ n r+s ||T|U 



^n r+s []p T(i) ®I nS ||oo||P p tll°o 
i=i 
k 

(62) ^n r+s J] || AiWoo 

i=i 

We shall prove, by induction on the number dist(<r, T), that one can find an input-output 
presentation of a generalized trace with an overhead s = dist(c, T). This is the key idea of the 
proof. 

Let us first consider the case when a G T. This means that, up to permutations pi G S r 
of the legs of each individual box Ai, the generalized trace is a trace of the product of all the 
matrices Ai, in some specific order given by a permutation r. In other words, one has 

k 

Tr a (A 1 ,...,A k ) = HA T(i) P^, 
i=i 

and this is an input-output presentation of a, without any overhead. This proves the initializa- 
tion step of the induction, dist(er, T) = 0. 

The inductive step of our claim is a consequence of the following lemma. 

Lemma A. 2. If a permutation a G St r admits an input-output presentation with overhead s 
and £ is a transposition, then cr£ admits an input-output presentation with overhead s + 1. 

Proof. Let £ = ((ji, yi), (j2, 2/2)) be the transposition in the statement. If j\ = ji, there is 
nothing to show, since the transposition £ can be absorbed in the tensor permutation matrix 
P®-iq\ in the presentation of a. This way, one obtains a presentation for <r£, with overhead s 
(hence it is possible with overhead s + 1). 

Suppose now ji ^ j'2 and let T be a presentation for a, as it is defined in (fUTj) . Let also 
ki = T _1 (ji) and ki = r _1 (j2). The presentation T can not be used directly to produce an 
input-output presentation for c£, since one of the legs of the matrix A^ needs to be connected to 
a different permutation and this is not allowed in the definition of an input-output presentation, 



see Figure 11(b) 



We shall modify the presentation T for a, by adding a unit of overhead, into a presentation 
T for cr£, as follows: 

k 

r = Yl(A T , (i) ®i n s +1 )p®, 
1=1 

with 

• t' = t : the order of the matrices Ai does not change; 

• For all i £ {ki, k%}, p'i(x) = p(x) Vx G [r + s] and p[(r + s + 1) = r + s + 1 : for all boxes 
not affected by £, the tensor permutation matrices are the same; 

• Pfci^i) = r + s + l, p' ki (r + s + l) = pkiilli) and p' kl (x) = pk x (x) for all x £ {yi,r + s + l}; 

• P'k 2 (y2) = r + s + 1, p' k2 (r+s+l) = p fc2 (y 2 ) and p' k2 (x) = Pk 2 {x) for all x £ {y2,r + s + l}. 
In other words, the extra copy of the space C" is used to implement the transposition £ in 

a way compatible with the input-output structure of the presentation T, see Figure [11] for a 
graphical representation of this procedure. 

□ 
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Figure 11. Input-output presentations for generalized traces. A presentation 
for a is depicted in the top figure. The center figure corresponds to cr^, but 
it is not an input-output presentation for this permutation, since it contains 
wires which do not connect consecutive blocks. The bottom figure depicts the 
presentation T', obtained by adding an extra copy of C™ and implementing the 
transposition £ through this space. 



Using the above lemma, one can produce an input-output presentation of a permutation 
inductively from a decomposition 

where 7 G T, s = dist(cr, T) and £j are transpositions. Start with a zero-overhead presentation 
of 7 and add the transpositions £$, each adding a unit of overhead to the presentation. At the 
end, one obtains a presentation for a with overhead dist(<r, T), and the L°° bound (|59p follows. 

(c) We now move to the I? bound (|60p . The proof strategy is exactly the same as in the L°° 
case and we sketch only the differences. We introduce two-column presentations as operators 
S : (C n )®( fcr / 2+s ) ->. (C™)®(W 2 + S ) of the form 

2 

(63) S = n 

3=1 

where t\,ti : [k/2] — > [k] encode the order of the matrices Ai in the presentation, and P® are 
tensor permutation matrices acting on (C n )®( fer / 2 + s ) ) with pj £ Skr/2+s- Figure [121 contains 
a two-column presentation of some generalized trace. Again, the integer s ^ is called the 



/A/2 
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overhead of the presentation. As before, we ask that a presentation encodes the generalized 
trace, for any choice of A{\ 

k 

Tr{S) = Tr f I K 




Figure 12. A two-column presentation with overhead s = 2 of a generalized 
trace, with r = 4. 

Using the Cauchy-Schwarz inequality, one can obtain the following bound: 



|Tr(5)| < 



>A Tl{i) ®I n s I P£ 



1=1 

k/2 fc/2 

^n s / 2 nPnwii2^ /2 nn^«ii2 



1=1 



i=l 



(64) 



i=l 



Since permutations a G admit two-column presentations with overhead s = 0, the bound 
(|60p holds in this case. For general a, one proceeds as in the L°° case, using Lemma IA.2I for 
each transposition £j appearing in the decomposition 

* = • • • 6, 

where 9 G O. □ 

Remark A. 3. T/ie case r = 1 of the L°° bound (|59|) is a trivial corollary of the result in |23j . 
since t/ie forest associated to the graph contains #a trivial trees. 
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